A Survey on Cross-Lingual Summarization
نویسندگان
چکیده
Abstract Cross-lingual summarization is the task of generating a summary in one language (e.g., English) for given document(s) different Chinese). Under globalization background, this has attracted increasing attention computational linguistics community. Nevertheless, there still remains lack comprehensive review task. Therefore, we present first systematic critical on datasets, approaches, and challenges field. Specifically, carefully organize existing datasets approaches according to construction methods solution paradigms, respectively. For each type dataset or approach, thoroughly introduce summarize previous efforts further compare them with other provide deeper analyses. In end, also discuss promising directions offer our thoughts facilitate future research. This survey both beginners experts cross-lingual summarization, hope it will serve as starting point well source new ideas researchers engineers interested area.
منابع مشابه
A survey on Automatic Text Summarization
Text summarization endeavors to produce a summary version of a text, while maintaining the original ideas. The textual content on the web, in particular, is growing at an exponential rate. The ability to decipher through such massive amount of data, in order to extract the useful information, is a major undertaking and requires an automatic mechanism to aid with the extant repository of informa...
متن کاملA survey of cross-lingual embedding models
Cross-lingual embedding models allow us to project words from different languages into a shared embedding space. This allows us to apply models trained on languages with a lot of data, e.g. English to low-resource languages. In the following, we will survey models that seek to learn cross-lingual embeddings. We will discuss them based on the type of approach and the nature of parallel data that...
متن کاملA Survey on Multi-Document Summarization
Multi-document summarization aims at delivering the majority of information content from multiple documents using much less lengthy texts, usually a short paragraph of several hundred words. This paper surveys several different approaches to multi-document summarization by first building a unified high level view of the multi-document summarization problem, and then comparing different approach...
متن کاملEvaluation of Text Summarization in a Cross-lingual Information Retrieval Framework
We report on research in multi-document summarization and on evaluation of summarization in the framework of cross-lingual information retrieval. This work was carried out during a summer workshop on Language Engineering held at Johns Hopkins University by a team of nine researchers from seven universities. The goals of the research were as follows: (1) to develop a toolkit for evaluation of si...
متن کاملComplex Cross-lingual Question Answering as a Sequential Classification and Multi-Document Summarization Task
In this paper, we describe the JAVELIN IV system, which treats complex question answering as a sequential classification and multi-document summarization task. Our research and development effort is based on various forms of linguistic annotation, and a comparison of various answer extraction and summarization algorithms. We discuss the use of different units of extraction, the effect of differ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Transactions of the Association for Computational Linguistics
سال: 2022
ISSN: ['2307-387X']
DOI: https://doi.org/10.1162/tacl_a_00520